Towards real-time multilingual multimodal speech-to-speech translation
نویسنده
چکیده
Speech-to-speech translation technology enables natural oral communication between different language speaking people. Many research projects have addressed speech-to-speech translation (S2ST) technology, such as ATR [1], VERBMOBIL [2], C-STAR [3], NESPOLE! [4], BABYLON [5], GALE [6], and EU-bridge [7]. The speechto-speech translation system is normally composed of automatic speech recognition (ASR), machine translation (MT), and speech synthesis (TTS). All of the modules are corpus-based and statistical model-based systems. In this talk, new challenges toward a real-time multimodal speechto-speech translation will be introduced.
منابع مشابه
Communicative Strategies and Patterns of Multimodal Integration in a Speech-to-Speech Translation System
When multilingual communication through a speech-to-speech translation system is supported by multimodal features, e.g. pen-based gestures, the following issues arise concerning the nature of the supported communication: a) to what extend does multilingual communication differ from ‘ordinary’ monolingual communication with respect to the dialogue structure and the communicative strategies used ...
متن کاملServices to Support Use and Development of Speech Input for Multilingual Multimodal Applications for Mobile Scenarios
Speech is our most natural form of interaction. Developing speech input modalities for several languages, combining speech recognition and understanding, presents various difficulties. While automatic translators ease the translation of normal text, the adaptation of grammars for several languages is currently performed based on an ad hoc approach. In this paper, we present a novel service that...
متن کاملThe NESPOLE ! Multimodal Speech-to-Speech Translation System: User Based System Improvements
This work discusses the results of two user studies aiming to evaluate the NESPOLE! speech-to-speech translation system, which provides for multilingual and multimodal communication in the tourism and in the medical domain, allowing users to interact through the Internet by sharing maps, web-pages and pen-based gestures. The purpose is to investigate the overall effectiveness of the combination...
متن کاملMultilingual Mobile-Phone Translation Services for World Travelers
This demonstration introduces two new multilingual translation services for mobile phones. The first translation service provides state-of-the-art text-to-text translations of Japanese as well as English conversational spoken language in the travel domain into 17 languages using statistical machine translation technologies trained automatically from a large-scale multilingual corpus. The second...
متن کاملImplementing and evaluating a multimodal and multilingual tourist guide
This paper presents the EURESCOM1 project MusT, (MUltimodal, multilingual information Services for small mobile Terminals). The project started in Febru ary 2001 and will last till the end of 2002. Based on existing technologies and plat forms a multimodal demonstrator (the MUST tourist guide to Paris) has been implemented. This demonstrator uses speech and pen (pointing) for input, and speec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014